|
|
Accession Number |
TCMCG075C05891 |
gbkey |
CDS |
Protein Id |
XP_007042822.1 |
Location |
complement(join(6535671..6535811,6536609..6536720,6537036..6537118,6537258..6537554,6537969..6538244,6538340..6539071,6539730..6540335)) |
Gene |
LOC18608196 |
GeneID |
18608196 |
Organism |
Theobroma cacao |
|
|
Length |
748aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_007042760.2
|
Definition |
PREDICTED: KH domain-containing protein HEN4 isoform X2 [Theobroma cacao] |
CDS: ATGGGCAGCACCTTCCTCTCTATACCAACAAAGCGAGCCATGCCCGACGCGACCCCTTCTTCGAACGGCCCCTCCAAGCGTTCCAAGCCTCCGGCTACTCCTCTCCCTGTCCCCCCTGGTCACGTTGCCTTCCGTCTCCTCTGCCACGTGTCCCGTGTCGGTGGCGTCATCGGCAAGTCAGGCAGTGTCATCAAGCAACTGCAACAGGCTACCGGTTCCAAGATCCGAATTGAGGATGCCCCGGCTGAAAGCCCGGACCGGGTTATTACCGTCATAGGCCCGAATGCTGTTAATACCAAGATTGTACTGAATTATGGTAGCCTTGGCAATGGTTACGGTAGTAGCGTTGAGGAAATCGATGTGTCCAAGGCGCAGGAGGCGTTAGTGAGAGTGTTCGAGAGGATTCTGGAGGTGGCGGCGGAGAGCGATGGAGTGGCCTTGGTGATGGTTTCTTGTCGGTTATTGGCGGAGGTTAAGCACGTTGGGAGCGTGATAGGGAAAGGAGGTAAGGTAGTGGAGAAGATAAGGGAAGATACTGGGACCAAAATTAGGGTTTTGACGGATAAGCTACCGGCTTGTGCCAGCCCCACGGAGGAGATTGTGGAGATTGAAGGAGGTGTTTTAGCTGTAAAGAAAGCGCTTGTTGCTGTCTCACATCGCCTCCAAGATTGCCCTCCTGTCAATAAAACAAGGATAACTGAAAACAGGATCATTGAATCAGTTCCTTCAGAGGCTTGGCATAAACCTATTGAGTTACTTCCTCAGGAGACTTTGCGAAGGCCTATTGACTTATTTCCCCAGGACACTTTGTACAGGCCTATTGACTTACTTCCTCAGGAGACTTTGCGCAGAGCTATTGAGGTACTTCCCCAGGAGACTTTGCACAGACCTATTGAGGTTGTTCCACAGGAGCCATTGCACAGACCTATTGATGTTGTTCCACAGGGCTCCTTGCGTAGACATATTGATGTTGTTCCACAGGGCTCCTTGCGTAGACCTATTGATGTTGTTTCTCAGGAGGCTTTACCTGATCTGAATATAGATCATCTTTCACAGCGTAGTTCCCTGATGCCTACTATATCCAGTAGCTCCATCAGTTATGCCACCAGAGTTCATCCTTTGTCTCTAGAGTCCGAGAATGCTTCTCCATTGGATACAAAAACATTGCAGCATGAAGTGGTTTTTAAAATTCTTTGCTCCAGTGATAGGGTTGGGGGTGTTATTGGAAAGGGAGGTGCAATCATTAAGGCTCTTCAAAGTGATACAGGAACTACTATTACCATTGGACCTACACTCACTGATTGTGATGAACGGTTGGTAACTGTTACTGCATCAGAGAACCCAGAATCACAGTATTCTCCAGCACAAAAGGCTGTTGTGCTTGTTTTTGTAAGAGCTTTGGAGGCGTCAATTGAAAAAGGGCTAGATTCAGGCTCAGGTAAGGGTTCAAATGTCACAGCTCGGCTTGTAGTTCCATCAGGCCAAGTTGGCTGTCTGTTGGGAAAAGGAGGTGCAATAATTTCTGAAATGCGTAAAGTGACTGGTACCGGCATTCGAATTTTGGGATCTGACCAGGTCCCTAAGTGTGTCACTGAAAATGACCAAGTGGTGCAGATTTCAGGAGGGTATTTGAATGTGAAAGATGCTATATATCATGTTACTGGTAGACTACGAGATAACCTATTTTCTAGCACACTGAAGAATGCTGGAGCAAAAAGTAGTTCTGCTGTTTTAACTGAGACCAGTCCTTATGAAAGATTGATGGACACTGCCCCTCTTGGGCTGCAAGTATCAAGTGGTGTTTCTTATAATCTTAGTCGGCATACGACATTGGCACCGAATAGTACGGATTCCTTTGGACTTTCCCGTAGTTTAGATTGCCCTCATTCACCAGGGTTATGGACATCAGAGACAGGTAATGTACTGAATCCAAGGAGCACCACAGATATCGGCAGAGGATTGACTTCTCTTAGAGGTGGATTTGAACTTGGCAGTGGAAACAGATCTGCTATTGTGACAAATACAACTGTAGAGATTAGAGTTCCTGAGAATGTTATTGACTCTGTTTATGGGGAGAATGGTCGCAATCTGTCTCGGTTGAGAGAGATCTCTGGTGCCAAGGTCATAGTGCATGAACCTCAAATAGGAACAAGTGACAGGATTGTTGTCATATCTGGGACACCTGATCAAACCCAGGCGGCTCAGAGCCTCCTTCAAGCTTTCATCCTCACTGGTCCATCACGTTGA |
Protein: MGSTFLSIPTKRAMPDATPSSNGPSKRSKPPATPLPVPPGHVAFRLLCHVSRVGGVIGKSGSVIKQLQQATGSKIRIEDAPAESPDRVITVIGPNAVNTKIVLNYGSLGNGYGSSVEEIDVSKAQEALVRVFERILEVAAESDGVALVMVSCRLLAEVKHVGSVIGKGGKVVEKIREDTGTKIRVLTDKLPACASPTEEIVEIEGGVLAVKKALVAVSHRLQDCPPVNKTRITENRIIESVPSEAWHKPIELLPQETLRRPIDLFPQDTLYRPIDLLPQETLRRAIEVLPQETLHRPIEVVPQEPLHRPIDVVPQGSLRRHIDVVPQGSLRRPIDVVSQEALPDLNIDHLSQRSSLMPTISSSSISYATRVHPLSLESENASPLDTKTLQHEVVFKILCSSDRVGGVIGKGGAIIKALQSDTGTTITIGPTLTDCDERLVTVTASENPESQYSPAQKAVVLVFVRALEASIEKGLDSGSGKGSNVTARLVVPSGQVGCLLGKGGAIISEMRKVTGTGIRILGSDQVPKCVTENDQVVQISGGYLNVKDAIYHVTGRLRDNLFSSTLKNAGAKSSSAVLTETSPYERLMDTAPLGLQVSSGVSYNLSRHTTLAPNSTDSFGLSRSLDCPHSPGLWTSETGNVLNPRSTTDIGRGLTSLRGGFELGSGNRSAIVTNTTVEIRVPENVIDSVYGENGRNLSRLREISGAKVIVHEPQIGTSDRIVVISGTPDQTQAAQSLLQAFILTGPSR |